A Novel Exploration/Exploitation Policy Accelerating Learning in Both Stationary and Non-Stationary Environment Navigation Tasks

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-stationary Subtasks Can Improve Diversity in Stationary Tasks

Low diversity in a genetic algorithm (GA) can cause the search to become stagnant upon reaching a local optimum. To some extent, non-stationary tasks avoid this problem, which would be a desirable feature of GA for stationary tasks as well. With this in mind, we show that several methods of introducing artificial non-stationary elements help to promote diversity in a GA while working on an inhe...

متن کامل

Adaptive robot learning in a non-stationary environment

Adaptive control is challenging in real-world applications such as robotics. Learning has to be rapid enough to be performed in real time and to avoid damage to the robot. Models using linear function approximation are interesting in such tasks because they offer rapid learning and have small memory and processing requirements. This makes them suitable as adaptive controllers in nonstationary e...

متن کامل

Learning dynamical systems in a stationary environment

We consider the problem of learning the input–output relation of a dynamical system from noisy data. Our method rests on the use of a smooth simultaneous estimator which generalizes the standard empirical estimator. In a stationary environment, our algorithm is shown to select a model which exhibits the Probably Approximately Correct (PAC) property under very mild conditions. This contribution ...

متن کامل

Non-Stationary Policy Learning in 2-Player Zero Sum Games

A key challenge in multiagent environments is the construction of agents that are able to learn while acting in the presence of other agents that are simultaneously learning and adapting. These domains require on-line learning methods without the benefit of repeated training examples, as well as the ability to adapt to the evolving behavior of other agents in the environment. The difficulty is ...

متن کامل

Bargaining in a non-stationary environment

We study an alternating offers bargaining model in which the set of possible utility pairs evolves through time in a non-stationary, but smooth manner. In general, there exists a multiplicity of subgame perfect equilibria. However, we show that in the limit as the time interval between two consecutive offers becomes arbitrarily small, there exists a unique subgame perfect equilibrium. Furthermo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer and Electrical Engineering

سال: 2015

ISSN: 1793-8163

DOI: 10.17706/ijcee.2015.7.3.149-158